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SIGNAL PERMUTING 

TECHNICAL FIELD 

This invention relates to signal permuting. 

BACKGROUND 

Signal permuting is used to change the routing paths of signals to connect two interfaces 
whose sets of signal lines have different orderings. For example, as shown in Figure 1 A, if an 
interface 10 to input circuitry 11 has lines L 0 to L 3 that carry Control, Data 1, Data 2, and Ground 
signals, and a second interface 12 to output circuitry 13 has lines L 0 to L 3 that carry Datal, Data 
2, Ground, and Control signals, the input and output interfaces are coupled by a 4-by-4 
permuting network 14 that maps [L 0 , Li, L 2 , L 3 ] to [L 3 , L 0 , Li, L 2 ]. 

A permuting network can be implemented using switches. In Fig. IB, the output of an n\\ 
switch 16 can reflect any one of its n inputs. By using a number of n:\ switches which receive 
the same set of n signals at their inputs, n signals can be permuted to any order at their output 
terminals. Fig. 1C shows a 2-by-2 permuting network implemented using two 2:1 switches 20, 
22 that permutes the ordering of two signals So and Si. 

An n:\ switch can be implemented using an n:\ multiplexer, a 2"xl random access 
memory that has its contents written to reflect a single address input, or an rc-input lookup-table 
in a field programmable gate array that is programmed to reflect any one of its inputs at its 
output. For each type of technology, the types of switches that can be most economically 
implemented may be different. 
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DESCRIPTION OF DRAWINGS 

FIGs 1A-1C are schematic diagrams of switches and permuting networks. 
FIGs 2-3 are schematic diagrams of permuting networks. 
FIG 4A is a diagram of a mapping between input and output terminals, 
FIG 4B is a diagram showing transition of position coordinates of signals. 
FIG 5 A is a schematic diagram of a computer having a permuting network. 
FIG 5B is a diagram of a mapping between the input and output terminals of the 
permuting network of FIG 5 A. 

FIG 6 is a schematic diagram of a permuting network. 

DETAILED DESCRIPTION 

The invention is directed towards construction of a permuting network using a 
predetermined set of building blocks. The building blocks are programmable switches of one or 
more type, each type being able to select one signal from among several signals. The permuting 
network is configured as a multidimensional switch array permuter (MSAP) that can permute the 
ordering of signals by programming the switches to select signals according to a permuting 
algorithm. 

In the figures, each permuter has input terminals at its left and output terminals at its 
right. Referring to Fig. 2 3 a multidimensional switch array permuter (MSAP) 200 that can 
permute N = wixw 2 x. ..xwd signals is implemented using 2D-1 layers of switches, layers Li 
through L2D-1. The numbers w\, W2, w D are integers. Each layer contains the same number (N) 
of switches of the same type, but the types of switches may differ among different layers. For X = 
1 to D, layer includes wx'A switches, each of which can select one signal from among wx 
signals. For example, layer Li contains W\:l switches, layer L2 contains w 2 :l switches, and layer 
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L D contains wtf.l switches. For X = ZHT to 2D-1, layer h x contains (2D-X):l switches. For 
example, layer Ld+\ contains wda'.I switches, and layer L2/H1 contains w\:l switches. By 
appropriately connecting the output terminals of switches in one layer to the input terminals of 
switches in the next layer, and appropriately programming the switches to select input signals, 
MSAP 200 can permute the ordering of a set of signals at its input into any arbitrary ordering of 
the signals at its output. 

The input signals of the switches in layer Li are the N input signals of the MSAP. The 
w\:l switches in the layer Li are configured to form a number of w\-by-w\ permuters 202, the 
number being Nlw\. Each wi-by-wi permuter 202 includes a number of switches, the 
number being w\. Each w>i-by-wi permuter 202 receives w\ signals from the input terminals 204, 
and outputs a set of re-ordered w\ signals on the output terminals 206. The output signals 206 of 
permuters 202 are sent to the input terminals of permuters in layer L2. The w 2 :l switches in layer 
L2 are configured to form a number of W2-by-W2 permuters 208, the number being N/W2. Each w 2 - 
by-u>2 permuter 208 receives w 2 signals at its input terminals 210, and outputs a set of re-ordered 
w 2 signals at its output terminals 212. The output signals of permuters 208 are sent to the input 
terminals of permuters in layer L3, and so forth. 

In general, the switches in layer Lx (A = 1 to D) are grouped into wx-by-wx permuters, each 
permuter receiving wx signals from the previous layer and generating a set of re-ordered wx 
signals at its output. The switches in layer Lx (A = D+l to 2D-1) are grouped into w 2 o-x~by-w 2 D-x 
permuters, each permuter receiving w 2 d-x signals from the previous layer and generating a set of 
re-ordered W2D-A signals at its output. The switch outputs in the last layer, L 2 d-\i are the outputs of 
the MSAP, which is a set of re-ordered N signals. 

The connections between the output terminals of one layer and the input terminals of the 

next layer are fixed (e.g., hardwired). The connections are represented by the regions 214 
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enclosed in dashed lines. The connections between each layer are configured so that each of the 
output signals of a permuter is sent to a different permuter in the next layer. Thus, each wx-by-wx 
permuter receives wx input signals from wx different permuters in the previous layer, permutes 
the order of the wx signals, and sends the re-ordered wx signals to wx different permuters in the 
next layer. 

An advantage of the invention is that the MSAP can be designed and constructed using 
various types of switches, depending on which types are available. For example, to design an 
MSAP that can permute N signals, the factors w t of N are first determined. Usually the number N 
can be factored in many ways, each of which corresponds to a different design. For example, 
suppose the factors w t are chosen so that N = wixw 2 x . . . xw D , then a MSAP having D dimensions, 
with wx being the width of the A-th dimension can be designed. The meaning of "width" and 
"dimension" will be apparent from later descriptions. 

For example, there are several MSAP configurations that can permute 48 signals: 

• A two-dimensional MSAP can be designed by choosing N = 48, D = 2, w\ = 6, 
and w 2 = 8. The 48-by-48 MSAP can be built by using a first layer of 6: 1 
switches, a second layer of 8:1 switches, and a third layer of 6:1 switches. 

• A three-dimensional MSAP can be designed by choosing N = 48, D = 3, w\ = 3, 
w 2 = 4, and w 3 = 4. The 48-by-48 MSAP can be built by using a first layer of 3 : 1 
switches, second, third, and fourth layers of 4:1 switches, and a fifth layer of 3:1 
switches. 

• A five-dimensional MSAP can be designed by choosing N= 48, D = 5, w\ = 2, 
w 2 = 2, w 3 = 2, w 4 = 2, and w$ = 3. The 48-by-48 MSAP can be built using first 
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through fourth layers of 2: 1 switches, a fifth layer of 3 : 1 switches, and sixth 
through ninth layers of 2: 1 switches. 

The advantage is apparent when different environments (e.g., different technologies for 
implementing the switches, or different brands of products of the same technology) provide 
different types of switches, or allow different types of switches to be implemented most 
economically. For example, in one environment, the basic building block may be 2:1 switches, 
and all other n\\ switches are constructed from the 2:1 switches. In another environment, the 
basic building blocks may include both 2:1 and 3:1 switches, and larger switches are built from 
these two types of switches. An JV-by-N MSAP may be constructed using building blocks that are 
the most economical for that particular environment. 

Following the previous example, to permute 48 signals, it is also possible to select a 
configuration of MSAP in which N is chosen to be larger than 48, so that some MSAP inputs and 
outputs are left unused. This structure uses more logic circuits, but may allow certain MSAP 
configurations to be available. For example, by choosing N= 64, the following three 
configurations can be used to construct a 64-by-64 MSAP: 

• AT=64,£> = 2,wi = 8,w 2 = 8. 

• N= 64, D = 3, w\ = 4, w 2 = 4, w 3 = 4. 

• N= 64, D = 6, w\ = 2, w 2 = 2, w 3 = 2, w 4 = 2, w 5 = 2, w 6 = 2. 

In general, for a particular environment, if larger switches have to be built from smaller 

switches, then to permute the same number (AO of signals, a higher dimensional MSAP using 

smaller switches will use less logic circuitry. In the above example, if the 8:1 and 4:1 switches 

have to be built from 2:1 switches, then among the three configurations, the six-dimensional 

MSAP using 2:1 switches is more economical (i.e., will use the smallest number of switches 

overall). Because the dimensional width wu is proportional to the sizes of switches, smaller 
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switches and less logic circuitry can be used by choosing w\ 9 m>2, w& as close together in 
value as possible. For example, when N= n D , the most economical configuration is to choose w\ 
= W2 = . . . = wo= n. For example, if larger switches have to be constructed from 2:1 switches, 
then the most efficient design for MSAP is to choose n = 2 and use 2:1 switches as the basic 
building blocks of the MSAP. See Table 1 below for a comparison of the efficiency of MSAP's 
constructed using switches of various sizes. 

In the example shown in Fig. 3 A, a 12-by-12 MSAP 300 is constructed from two layers 
of 3:1 switches and one layer of 4:1 switches. MSAP 300 includes layers Li to L3. Layer Li 
includes 3:1 switches that are configured to form 3-by-3 permuters 302, 304, 306, and 308. 
Layer L2 includes 4: 1 switches that are configured to form 4-by-4 permuters 3 10, 3 12, and 314. 
Layer L3 includes 3:1 switches that are configured to form 3-by-3 permuters 316, 318, 320, and 
322. Within each permuter, different switches select different input signals so that there is no 
overlap in their selections, resulting in a re-ordering of the input signals appearing at the output 
terminals of the permuter. 

Each 3~by~3 permuter in layer Li receives 3 signals at its input terminals, permutes the 
ordering of the 3 signals, and sends the re-ordered 3 signals to three different permuters in layer 
L2. For example, the first output terminal of permuter 302 is connected to the first input terminal 
of permuter 310. The second output of permuter 302 is connected to the first input of permuter 
312. The third output of permuter 302 is connected to the first input of permuter 314, and so 
forth. In this way, each of the three output signals of a permuter in the layer Li will be re-ordered 
by a different permuter in the layer L2. 

Likewise, each output of a permuter in the layer L2 is connected to an input of a different 
permuter in layer L3. Thus, each of the four output signals of a permuter in layer L2 will be re- 
ordered by a different permuter in layer L 3 . By properly re-ordering signals using each 
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permuters, the 12-by-12 MSAP can re-order 12 input signals into any orderings of the 12 output 
signals. 

As an illustration of how permuters re-order signals, dashed lines are shown inside each 
permuter representing routing of signals from input terminals to output terminals. Twelve input 
signals with the ordering [Si, S2, S12] are re-ordered into ordering [S 7 , S 5 , S3, S9, S 8 , Si, S10, 
S2, S6, S12, Sn, S4] at the output of MSAP 300. The following illustrates how signal S3 is re- 
ordered from line #3 at the input terminal at the left of the figure to line #6 at the output terminal 
at the right of the figure. Permuter 302 re-orders signal S 3 from input #3 to output #1, which is 
connected to input #1 of permuter 310. Permuter 310 re-orders signal S3 from input #1 to output 
#2, which is connected to input #4 of permuter 318. Permuter 3 1 8 then re-orders signal S3 from 
input #4 to output #6. The inputs and outputs of the permuters in each layer are numbered #1 to 
#12 from top to bottom in the figure. 

The following describes the connections between the output terminals of switches in one 
layer and the input terminals of switches in the next layer. These connections are fixed (e.g., 
hardwired). For description purposes, the switches in each layer of the MSAP are said to form a 
D-dimensional array. The switches in layer are denoted as S [X] [x\ , x 2? . . . , *d] , with 1 □ Xk □ Wk, 
k ranging from 1 to £>. The number x\ represents the 1 st dimension coordinate of the switch, the 
number x 2 represents the 2 nd dimension coordinate, and so forth. The inputs to these switches 
will be denoted by adding subscripts to the ends of switch names. For example, switch 
S[l][2,3,4] has inputs S[l][2,3,4]i, S[l][2,3,4] 2 , etc. The outputs will be denoted by adding an 
asterisk (e.g. S[l][2,3,4]*). 

The output signals of the MSAP are the outputs of the last layer L 2 /m, i.e., the MSAP 

outputs are S [2D- 1 ] [x\ , X2, . . . , x D ] * for the various combinations of x\ , x 2? . . . , *z> To simplify 

description of notation for the switches, a fictional layer Lo is used so that the MSAP inputs are 
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the L 0 outputs. Thus, the MSAP inputs are S[0][xi, x 2 , • . to]* for the various combinations of 
xi,x 2 , ...,to. Although there is no switch S[0][1, 1], there is a signal S[0][1, 1]*, which 
is the MSAP input. 

For X = 1 5 2, . . . , Z), the permuters in layer La arbitrarily permute the outputs of the 
permuters in the previous layer along dimension X only. This means that the switches in a 
permuter in layer La only receive signals from switches in the previous layer having labels that 
are different in dimension A, and only permute those signals. Each of the w\\ 1 switches in layer 
Lt(XOD), denoted S [X] [xi,x 2 , ...,to], has w\ inputs, i.e., S[/lj[xi,x 2 , ...,x D ~\u S[AJ[xi, x 2 , 
toK . . •» S[A][xi, x 2 , . . to](^A). These inputs map to all the outputs of the previous layer with 
switch coordinates differing only in dimension X: 

S[X][x u x 2 , ...,xx, ...>to]i = S[X-l][xux 2 , 1, ...,to]* 

S[2][xi,x 2 , ...,xx, ... ? to]2 = S[A-l][xi,x 2 , ...,2, ...,xz)]* 

S[A][Xi,X 2 , ...,XJu - -,X/>](Wji) = S[i-l][Xi,X2, wi, ...,x D ]* 

(1 □ A □ Z); anyxi,x 2? ...,xd) 

For A, = D+l, D+2, . .., 2ZM, layer arbitrarily permutes the outputs of the previous 
layer along dimension 2D A only. This means that the switches in a permuter in layer L\ only 
receive signals from switches in the previous layer having labels that are different in dimension 
2D-X, and only permute those signals. Each of the W2zu:l switches in layer (X > D\ denoted 
S[A,][xi,X2, ...,to],has ww-x inputs, i.e., S[X][xi,x 2 , x D ]u S[X][x h x 2? ...,x D ] 2 , S[A,][xi,x 2 , 
. . ., xo](W2D-xy These inputs map to all the outputs of the previous layer with switch coordinates 
differing only in dimension IDA: 

S[X][Xi,X 2 , ...,X2WU ...,to]i = S|>l][xi,x 2 , 1, ...,x D ]* 

S[A][Xi,X 2 , ...,X 2Ml , ...,to]2 = S[Arl][xl,X 2 , ...,2 5 ...,X/)]* 

-8- 
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$[X][x U X 2 , ..-,X2D4 a ••-,X D ](W2D-X) = S[X-l][XuX 2 , . . . , M>2Z>-X, 

(D+l □XD2D-l;anyjci ? x 25 . ..,*/)) 

Referring to Fig. 3B, the switches in MSAP 300 have been labeled as S[AJ[xi, x 2 ], X 
ranging from 1 to 3, Xi ranging from 1 to 3, and x 2 ranging from 1 to 4. For convenience of 
notation, the MSAP input signals are described as the output signals of layer Lo, and are labeled 
as S[0]D j]*, i ranging from 1 to 3 5 and j ranging from 1 to 4. Layer Li permutes the signals 
along the first dimension. This means that the switches in layer L 0 , whose outputs are connected 
to the inputs of a switch in layer Li, have labels with different coordinates in the first dimension 
but same coordinates in the second dimension. For example, the three inputs of switch S[l][2,3] 
are connected to outputs of switches in layer Lo as follows: 

S[1][2,3]! = S[0][1,3]* 

S[1][2,3] 2 = S[0][2,3]* 

S[1][2,3] 3 = S[0][3,3]* 

Layer L 2 permutes the signals along the second dimension. This means that the switches 
in Li, whose outputs are connected to the inputs of a switch in L 2 , have labels with different 
coordinates in the second dimension but same coordinates in the first dimension. For example, 
switch S[2][2,3] has its four inputs connected as follows: 

S[2][2,3h = S[l][2,l]* 

S[2][2,3] 2 = S[1][2,2]* 

S[2][2,3] 3 = S[1][2,3]* 

S[2][2,3] 4 = S[1][2,4]* 

Layer L3 permutes the signals along the first dimension. For example, switch S[3][2,3] 
has its three inputs connected as follows: 
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S[3][2,3]i = S[2][l,3]* 
S[3][2,3] 2 = S[2][2,3]* 
S[3][2,3] 3 -S[2][3,3]* 

The output signals of MSAP 300 are the 12 switch outputs from layer L 3 , i.e., S[3][l,l]* 
through S[3][3,4]*. 

An MSAP switch topology has a recursive structure. If the first and last layers of 
switches (Li and L 2 zm) are removed from an MSAP of dimension D > 1, the remaining portion is 
a number of smaller MSAP's of dimension D-l, the number being wi. In general, in an MSAP 
with dimension D > 1, for each fixed coordinate c between 1 and w\ in the first dimension, the 
subset of switches {S[A,][c,x 2 ,x 3 , -~> x d]} (A, ranging from 2 to 2D-2 and each x k ranging from 1 
to w k ) forms a sub-MSAP of dimension D-l with dimensional widths w 2 , . . ., wd. Each sub- 
MSAP has its own sub-MSAP's, and so forth, down to the level of one-dimensional MSAP's. 
This recursive structure of the MSAP is reflected in the MSAP switching algorithm described 
below. 

The switches in an MSAP are configured according to a switching algorithm so that the 
MSAP can permute signals in any chosen order. To facilitate description of the algorithm, each 
input signal to the MSAP is described as having a starting position and a target position. The 
position of a signal in a particular layer is written as [x\, *2, - - - , xd], and corresponds to the label 
of the switch that selected the signal as output. For example, in Fig. 3B, the signal at input line 
#3 has a starting position of [3,1] because the signal is selected by switch S[0][3,l] and reflected 
at its output. As described previously, layer L 0 is a fictional layer and used just for convenience 
of description. When the signal propagates to layer Li, its position changes to [1,1] because the 
signal is selected by switch S[l][l,l]. When the signal propagates to layer L 2 , its position 
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changes to [1,2] because the signal is selected by switch S[2][l,2]. Then in layer L 3 , the position 

of the signal changes to [3,2] because the signal is selected by switch S[3][3,2]. 

In a three dimensional (2>=3) MS AP, suppose an input signal has a starting position of 

[2,5,3] and a target position of [4,1,6]. This means that the MSAP switches must be set in a 

manner so that the input signal S[0][2,5,3]* is routed to the output signal S[5][4,l,6]*. As a 

signal propagates through the MSAP, the signal's position is adjusted in each switch layer, 

reflecting the switches that the signal has propagated through. Because each layer only permutes 

the signals in one dimension, only one coordinate of the signal's position is adjusted in a layer. 

For example, in the D=3 MSAP, a signal may follow the position path: 

L 0 : [2,5,3] 
Li: [8,5,3] 
L 2 : [8,7,3] 
L 3 : [8,7,6] 
L 4 : [8,1,6] 
L 5 : [4,1,6] 

This position path determines how one switch in each layer must be set. In order that the 
signal propagates from L 0 position [2,5,3] to Li position [8,5,3], switch S[l][8,5,3] must be set to 
position 2. In order that the signal propagates from Li position [8,5,3] to L2 position [8,7,3], 
switch S[2][8,7,3] must be set to position 5, and so forth. When all of the position paths for the 
MSAP's input signals have been determined, all of the switches in the MSAP can be set so that 
all input signals are routed to the proper output terminals of the MSAP. Thus, a collection ofN 
position paths (or signal paths) can be used to determine a switch-setting configuration for an 
MSAP. 
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The MSAP Switching Algorithm 

The MSAP switching algorithm provides a way of configuring an MSAP to route input 
signals to achieve every possible signal permutation at the output terminals. However, the 
configuration provided by the switching algorithm is not the only way that an MSAP can be 

5 configured to permute signals. It is possible to use a "brute-force" method by looking through all 
possible switching combinations in some intelligent (e.g., recursive) manner to find a solution. 

The MSAP switching algorithm includes Algorithm 1, which calls Algorithm 2. In the 
description of Algorithm 1 below, the MSAP switching procedures are described in terms of 
signal position paths rather than in terms of MSAP switch settings. There are N signals, each of 

1 o which has a D-dimensional starting position and a D-dimensional target position. Algorithm 1 

determines the position paths for all the signals, giving each signal's position coordinates in each 
layer from L 0 to L 2 zm in a manner that is allowed by the MSAP topology. No two signals can 
occupy the same position in any layer. The Lo position of each signal is its starting position. The 
L 2 £-i position of each signal generated by Algorithm 1 will match the signal's target position. 

15 

Algorithm 1 

Algorithm 1 calls itself recursively. Each time Algorithm 1 is called, a parameter S that 
represents a subset of the input signals is passed on to Algorithm 1 . When Algorithm 1 is called 
initially, parameter S = {all the signals} is passed to Algorithm 1 . In subsequent recursive calls, 
20 parameter £ will contain successively smaller subsets of signals. Algorithm 1 keeps track of the 
depth of its recursion by passing a level parameter 1. When Algorithm 1 is initially called, 
parameter X = 1 is given to the algorithm. When Algorithm 1 calls itself again, parameter X = 2 is 
passed, and so forth. 
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For any given N, D 9 w\, W2, . . and w D such that N= w\ x W2 x . x wd, Algorithm 1 
performs the steps below using the parameter X to represent the recursion depth and J to 
represent the subset of signals to operate on. Initially, set X=\. 

Step 1 : If X = D 9 jump to step 5. 

Step 2: Copy the Lx-i position of each signal in I to its L% position. 

Step 3: Call Algorithm 2 to permute the Lx positions of signals in E, changing only the X th 
position coordinate of each signal, in such a way that no two signals in E with the 
same X th position coordinate have identical (X+l) st to target coordinates. 

Step 4: For each c, ranging from lto w\, recursively call Algorithm 1, passing parameters: 
S = the subset of signals in Z which have X th position coordinates equal to c in 

layer U,; and 
X = X+l. 

Step 5: Permute the signals in S from their positions in L20-XA to new positions in Lid-x, 
by changing the X th position coordinate of each signal to its X th target coordinate. 

The permutations performed in Steps 3 and 5 of Algorithm 1 permutes signals only along 
a particular dimension allowable for that layer. For example, the switches in layer Lk permute 
signals only along dimension k. Algorithm 1 also follows the rule that no two signals occupy the 
same position in a given layer. This rule can be ensured because Algorithm 2, as described 
below, moves signals by swapping pairs of signals, thereby preventing collisions of signals (i.e., 
two signals being selected by the same switch). 

Algorithm 2 

The following is a description of Algorithm 2 called by Step 3 of Algorithm 1 . Due to the 

recursive nature of Algorithm 1, in layer U, all the signals in S will have the same 1 st to (A,-l) st 

position coordinates. Their X th to 0 th position coordinates, on the other hand, will run through all 

- 13- 



Attorney Docket: 10559-635001 / P12330 

possible combinations. Thus, although each signal still has D position coordinates, 2 is 
effectively a (Z)-A+l)-dimensional array of signals according to positions in layer L\. The X th 
position coordinate of each signal in 2 determines the "slice" of this array that signal is in. The 
(1+1 ) st to D th position coordinates of each signal determine the "column" of the array that the 
signal is in. The definitions of "slice" and "column" are given later. Two signals with the same 
X th position coordinates are in the same slice, and two signals with identical (1+1 ) st to D th 
position coordinates are in the same column. Thus, the array of signals in 2 is divided into wx 
different slices, numbered 1 through w\. The array is also divided into w\+\xw\+2><- ..xw^ 
different columns, not numbered. Step 3 of Algorithm 1 only changes the X th position coordinate 
of each signal so that a signal only moves within its column. 

The following is a description of the terms "slice" and "column." A slice is a subset of 
the available position coordinates containing subsets of 2 of signals that are processed by 
Algorithm 2. For example, consider an MSAP with D = 5, w\ = 6, w 2 = 5, w 3 = 4, w 4 = 3, and 
w 5 = 2. In this case, a total of 6x5x4x3x2=720 signals are permuted, each of which has 
5-dimensional starting position coordinates and 5-dimensional target position coordinates. For 
example, the signal starting in one position [1,1,1,1,1] may have a target position [6,5,4,3,2]. 
Suppose Algorithm 2 is called when 1 = 3, representing the 3 rd level of recursion in Algorithm 1. 
In this case, 2 is a subset of signals with constant L 2 position coordinates in dimensions 1 and 2. 

For example, all the signals in 2 may have L 2 position coordinates of the form [4,2, , , ], so 

that 2 contains exactly 4x3x2 = 24 signals: [4,2,1,1,1], [4,2,4,3,2]. 

In Step 2 of Algorithm 1, the L 2 positions of the signals in 2 are copied to their L3 

positions, so the L 3 positions of the 24 signals are also of the form [4,2, , , ]. Algorithm 2 is 

called to rearrange the 24 signals in 2 within layer L 3 , i.e., Algorithm 2 changes the L 3 position 
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coordinates of the 24 signals in E. Algorithm 2 divides the 24 signals in E into 4 "slices" and 6 
"columns" based on their position coordinates. Each of the 4 slices have 6 signals, and each of 
the 6 columns have 4 signals. 

The 4 slices contain those signals with a given 3 rd (since X = 3) L 3 position coordinate. 
There are 4 slices because X = 3 and w 3 = 4. The slices are called "slice 1 " through "slice 4." 
Slice k contains those signals in E whose 3 rd L 3 position coordinate is t For example, slice 3 
contains 6 signals whose L 3 position coordinates are [4,2,3,1,1], [4,2,3,1,2], [4,2,3,2,1], 
[4,2,3,2,2], [4,2,3,3,1], and [4,2,3,3,2]. During this call to Algorithm 2, the signals in slice 3 have 
these six L 3 positions, although the particular signals occupying those positions may change. 

When Algorithm 2 swaps two signals, those two signals will exchange slices, meaning 
that they will swap their 3 rd Imposition coordinates. The 6 columns, meanwhile, contain those 
signals with a given combination of 4 th and 5 th L 3 position coordinates (the coordinates above 
X = 3). Thus, there is a column whose signals have L 3 position coordinates [4,2,1,1,1], 
[4,2,2,1,1], [4,2,3,1,1] and [4,2,4,1,1]. There is another column whose signals have L 3 position 
coordinates [4,2,1,3,2], [4,2,2,3,2], [4,2,3,3,2] and [4,2,4,3,2]. The term "column," as used in this 
example, describes a set of L 3 position coordinates with their 4 th and 5 th dimension coordinates 
fixed. Thus, signals with L 3 position coordinates [4,2,2,3,1] and [4,2,4,3,1] are said to be in the 
"same column" because their 4 th and 5 th coordinates [3,1] match. In Algorithm 2, a column 
always retains the same set of signals after permutation because the swapping action only 
exchanges two signals in the same column. 

Having described the definition of the terms "slice" and "column" the following is a 
description of the steps performed by Algorithm 2: 

l.FORS^lTO w x -lDO: 
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2. WHILE slice S contains at least 2 signals with identical (X+l) st to D th target coordinates 
DO: 

3 . SET X 0 = one of these signals 

4. FOR£=0, 1,2, ...DO: 

5 5. Look for a signal Z in slices 5+1 through and in the same column as 

X K (selected in Step 3 or 6) with the following property: Z's 
(k+l) st to D th target coordinates are not identical to the (A,+l) st to 
D th target coordinates of any signal in slice S. IF found THEN 
BREAK from the "FOR KT loop. 
10 6. Select a pair of signals signal Y KH and X K+i with the following properties: 

y ; r^ +I is in slices £+1 through and in the same column as Xq, X u 

5 ...,orJfo 

Hi 

[7; X^+i is in slice S, and is different from X 0 , X b . . ., and J^; 

+; Y K +\ and Z^+i have identical (X,+l) st to D th target coordinates, 

jr ; 1 5 There will always be such a pair of signals. 

O 7. END FOR K 

U 8. LOOP: 

jj; 9. Swap signal Z with signal ^ 

Sj 10. IF K = 0 THEN BREAK from the LOOP 

p 20 11. SETZ=}^ 

12. Z is now in the same column as some Xj. SET K to this J. 

13. END LOOP 

14. END WHILE 

15. END FOR S 

25 

Algorithm 2 works by "fixing" one slice at a time, i.e., once a slice is fixed, its signals are 
not changed in successive calls to the algorithm. The WHILE condition of step 2 checks the 
current slice for precisely the property that Algorithm 2 is trying to eliminate. When this property 
is no longer found, that slice is fixed. A fairly small number of times through Steps 3-13 always 
30 fixes a slice. 

The goal of Steps 3-13 is to move a signal with "duplicate" (X+lf to target 

coordinates out of layer 5, replacing it with some other signal from above layer S whose (X+ l) st 
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th 

to D target coordinates are missing from layer S. In the ideal case where these two signals are in 
the same column, Algorithm 2 just swaps the two signals. However, such a pair of signals may 
not be in the same column, so a chain of swaps are performed, each of which operates within a 
different column. The first swap operation moves a signal with missing (1+1 ) st to D th target 
coordinates into layer S, and the last swap operation moves a signal with duplicate (A,+l) st to D th 
target coordinates out of layer S. 

Fig. 4A shows an example of a mapping between the input and output terminals of 
MSAP 300. Fig. 4B shows an example of how the position coordinates of the input signals of a 
12-by-12 MSAP are changed from their starting position coordinates to their target position 
coordinates. The changes in position coordinates were determined using Algorithm 1 and 
Algorithm 2 above. Layer Li only changes the coordinates of the first dimension, layer L 2 only 
changes the coordinates of the second dimension, and layer L 3 only changes the coordinates of 
the first dimension. 

An advantage of an MSAP is that it efficiently uses pre-selected switches as basic 
building blocks. The simplest and most efficient MSAP's use only n:l switches (n being an 
integer) to permute N = rP signals. In such a case, there are 2D-X layers, each containing N 
switches. Because D = log^V, there is a total of Nx(2 log^V-l) ml switches. Therefore, the 
amount of logic circuitry required to construct an MSAP using pre-selected switches with fixed 
switch sizes is on the order of N log N. 

A comparison of the efficiency of MSAP's constructed using switches of various sizes is 

given below in Table 1. Here, it is assumed that larger switches are built from smaller switches. 

Table 1 shows that an MSAP using larger switch sizes requires more logic circuitry. For 

example, if 2:1 switches are available, then building an N=64 MSAP is the most efficient using 

eleven 6-dimensional layers of 2:1 switches. Using five 3-dimensional layers of 4:1 switches 
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requires 36% more logic circuitry. Using three 2 -dimensional layers of 8:1 switches requires 
91% more logic circuitry. Using one one-dimensional layer of 64:1 switches (effectively a set of 
parallel multiplexers) requires 473% more logic circuitry. 



Table 1 











2:1 switches 


Equivalent 


Ratio 






nil switches 




Total n:l 


comprising each 


total 2:1 


vs. 


Difference 


n 


per layer 


Layers 


switches 


w:l switch (= n-l) 


switches 


n=2 


vs. n=2 


2 


64 


11 


704 


1 


704 


100% 


0% 


4 


64 


5 


320 


3 


960 


136% 


36% 


8 


64 


3 


192 


7 


1344 


191% 


91% 


64 


64 


1 


64 


63 


4032 


573% 


473% 



An application of an MSAP is to permute signals between a motherboard and a daughter 
board in a computer. Referring to Fig. 5 A, a computer 500 includes a daughter board 506 
coupled to a motherboard 502 through an interface 504 that has signals lines #1 to #12. The 
daughter board 506 includes a memory 512 that can be accessed by motherboard 502. Daughter 
board 506 and motherboard 502 are manufactured by different companies so that motherboard 
502 sends signals on the signal lines of interface 504 in an order that is different from the order 
recognized by memory 512. 

Daughter board 506 includes a permuting network 508 that has input terminals connected 
to interface 504 and output terminals connected to an interface 510, which in turn is connected to 
memory 512. Permuting network 508 routes the signals in the signal lines of interface 504 to the 
signal lines of interface 510 with a new ordering that is acceptable to memory 512. The mapping 
between signal lines of interface 504 and signal lines of interface 510 is shown in Fig. 5B. As an 
example, the signal si on line #1 of interface 504 is mapped to line #7 of interface 510, and 
signal sl2 on line #12 of interface 504 is mapped to line #4 of interface 510. Similarly, the 
signals sent from memory 512 are re-ordered by permuting network 508 and then passed on to 
motherboard 502. 
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Permuting network 508 is implemented using a field programmable logic array (FPGA). 
The FPGA includes configurable logic components, each of which can be programmed to 
receive a number of inputs and to pass any one of the inputs to its outputs, just like an n:l switch. 
When computer 500 is first booted, daughter board 506 communicates with motherboard 502 to 
determine the required permutation. A processor (not shown) on daughter board 506 executes an 
MSAP switching algorithm to program the configurable logic components to construct an MSAP 
that achieves the required permutation. 

In the example shown in Fig. 6, a 18-by-18 MSAP 600 is constructed from four layers of 
3:1 switches and one layer of 2:1 switches. In this example, N = 18 = 3x3x2, i>3, w>i=3, W2=3, 
and w 3 =3. MSAP 600 includes layers Li to L 5 . Layers L u L 2 , L 4 , and L 5 include 3:1 switches that 
are configured to form 3-by-3 permuters. Layer L 3 includes 2:1 switches that are configured to 
form 2-by-2 permuters. Within each permuter, different switches select different input signals so 
that there is no overlap in their selections. This results in a re-ordering of the input signals 
appearing at the output terminals of the permuter. 

In Fig. 6, the switch labels are shown inside the permuters. The three switches in a 
permuter 602 in layer Li have the same 2 nd and 3 rd coordinates but different 1 st coordinates. 
Thus, permuter 602 permutes the position coordinates of input signals in the first dimension. 
Likewise, other permuters in layer Li also permute the position coordinates of input signals in 
the first dimension. The three switches in a permuter 604 in layer L 2 have the same 1 st and 3 rd 
coordinates but different 2 nd coordinates. Thus, permuter 604 permutes the position coordinates 
of input signals in the second dimension. Likewise, other permuters in layer L 2 also permute the 
position coordinates of input signals in the second dimension. 

In a similar manner, permuters in layer L 3 permute the position coordinates of input 

signals in the third dimension. Permuters in layer L 4 permute the position coordinates of input 
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signals in the second dimension. Permuters in layer L5 permute the position coordinates of input 
signals in the first dimension. Each of the output signals of a permuter in layer Li is sent to a 
different permuter in layer L 2 , each of the output signals of a permuter in layer L 2 is sent to a 
different permuter in layer L3, and so forth. 

MSAP 600 is designed to allow any of the 18 input signals to be routed to any of the 18 
output terminals. In operation, the MSAP switching algorithm is performed to determine the 
transition paths for the 1 8 input signals. The transition paths go through the five layers of 
switches in a way such that each signal is routed to the desired output terminal and that no two 
signals occupy the same switch. The switches are then programmed to select input signals 
according to the signal paths determined by the MSAP switching algorithm. 

Other embodiments are within the scope of the following claims. For example, the 
switches may be implemented using devices other than an FPGA. The switches may be 
programmed by a processor on the motherboard rather than by a processor on the daughter 
board. The configuration of switches (i.e., which signal is selected from its inputs and reflected 
at the output) may be dynamically programmable, or be programmable once and then fixed 
permanently. An MSAP that performs bit permutations can be used as a building block of an 
encoder/decoder that encrypts/decrypts messages. The MSAP can be used to construct 
telecommunication networks to route signals from nodes at one location to nodes at another 
location in a non-blocking manner. The MSAP can also be used in massively parallel or 
supercomputing applications to route signals among different processors. 
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